Job Radar. Live notifications. AI processed.
upwork.com 2026-04-14 🟡
🔹 Extract data from 1,000,000+ Trustpilot pages
👤 Client: 🇫🇷 France Member since 2025-09-19
💰 Price: $50
🚩 Problem: Scrape and structure large-scale data from Trustpilot to derive insights.
📦 Existing: [Not specified]
Specifications:
[Target] Extract data from 1,000,000+ Trustpilot pages
[Method] Implement a scalable scraping solution with anti-bot mechanisms
[UI/UX] Not applicable
[Stack] Python (Scrapy or BeautifulSoup), Selenium, Proxy services
[Security] Ensure compliance with Trustpilot terms of service; use secure proxies and handle data securely
[Format] CSV for clean, deduplicated dataset
Workflow:
Set up a scalable scraping environment using Python libraries like Scrapy or BeautifulSoup.
Implement anti-bot mechanisms to handle rate limits and avoid detection by Trustpilot's systems.
Extract required fields: Trustpilot URL, domain name (reviewed website), rating score, email (if available).
Store data temporarily in a structured format for deduplication checks.
Deduplicate the dataset based on unique identifiers.
Export clean, deduplicated dataset in CSV format.